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Abstract 

In this paper, we present a framework based on a simple data structure and parameterized algorithms 
for the problems of finding items in an unsorted list of linearly ordered items based on their rank 
(selection) or value (search). As a side-effect of answering these online selection and search queries, we 
progressively sort the list. Our algorithms are based on Hoare's Quickselect, and are parameterized based 
on the pivot selection method. 

For example, if we choose the pivot as the last item in a subinterval, our framework yields algorithms 
that will answer q < n unique selection and/or search queries in a total of 0(n log?) average time. After 
q = Q(n) queries the list is sorted. Each repeated selection query takes constant time, and each repeated 
search query takes O(logn) time. The two query types can be interleaved freely. 

By plugging different pivot selection methods into our framework, these results can, for example, 
become randomized expected time or deterministic worst-case time. 

We extend the algorithms and data structures in our framework to obtain results that are cache- 
oblivious I/O efficient and/or dynamic and/or compressed. 

Our methods are easy to implement, and we show they perform well in practice. 

1 Introduction 

Sorting and searching are two of the most fundamental problems in Computer Science; they have been 
studied extensively for decades 8, 20 . In this paper, we present a framework that includes algorithms based 
on Hoare's well-known quickselect algorithm [TS1 HH Qj|] together with a simple bitvector data structure for 
performing online selection and search queries on an initially unsorted list of linearly ordered items. The list 
is progressively sorted as queries are answered, with the result that subsequent queries become less expensive 
to process. The parameter in our framework is the method used to select the pivot element for partitioning, 
and varying this method leads to different running time bounds. 

In the literature, there are many algorithms for selection and multiple selection (where selection queries 
are known a priori), based on Hoare's quickselect, that largely vary only in how pivots are chosen. See, for 
example, [H [HI HU [22J, [24] [25] [23] . (Note that [18] gives a thorough background on selection and multiple 
selection algorithms.) We show that with our framework, all of these results that are based on quickselect 
can be performed in an online fashion, and can include search queries as well, with no significant penalty 



in the running time analysis. We also show that the algorithms and data structures of our framework 
can be extended to be cache-oblivious I/O efficient, dynamic (allowing a modest number of updates), and 
compressed. 

To be more specific, given a list of n linearly ordered, but unsorted items as input in the form of an 
array, we show how to use a simple bit vector data structure to keep track of all previously chosen pivots 
that now occupy their correct position in the array. By first checking this bit vector, a selection query can 
start its work in the unsorted subinterval of the array where its answer lies. Our searching algorithm works 
in the same way - it first performs a binary search to find the unsorted subinterval holding the answer, and 
then uses a slightly modified quickselect to search from there. It should then be obvious that the search and 
select query algorithms are furthering the work of a quicksort algorithm in stages, performing exactly the 
same work, though perhaps not in the same order, and hence having the same overall complexity (ignoring 
the time spent manipulating the bit vector). 

Let q be the number of distinct selection and/or search queries (repeated queries take relatively insignif- 
icant time). By varying our method for choosing pivots, we can make use of the analyses performed by 
previous researchers to obtain results such as these: 

• If we choose the last item of a subinterval as the pivot (as is done in a typical undergraduate data 
structures textbook) we show that q queries take 0(n log q) deterministic average time^U 

• If we use a median-of-medians approach to choosing pivots (such as is done in [5]) we obtain an 
0(n\ogq) deterministic worst-cast time. 

• If we use the randomized pivot selection algorithm as described in [TS] we obtain an 0(n log q) expected 
running time, and furthermore, we also match their bound of B + 0{n) expected comparisons, where 
B is the information-theoretic minimum number of comparisons as they define it, if we allow only 
selection queries. 

1. Note: To include search queries in this result, it suffices that we have at most 0(nj logn) searches 
in order to absorb the cost of binary search in the 0(n) error term, and the rank of the item 
searched for is used in computing B. However, we are then treating search queries as selection 
queries, with added cost, for defining the information-theoretic minimum, and this surely is not 
correct. Addressing this issue is beyond the scope of this paper. 

2. Note: [TB] also presents a deterministic multi-selection algorithm, not directly based on quickselect, 
that uses B + o(B) + 0(n) comparisons were B is as before. Our online framework, as presented 
here, does not directly apply to this algorithm, but with some significant modifications it does, 
and we will describe how that is done in the full version of this paper. 

We present our selection algorithm in Section 12.41 including a brief proof of the average case analysis men- 
tioned above. Search queries are described in Section 12.61 Our framework is easy to implement, and we 
present some preliminary experimental results in Section [SJ 

Sibeyn [27] briefly considered multi-selection in external memory |29j , but was not able to achieve optimal 
bounds in either the a priori case or the online case. Franceschini and Grossi [llj recently improved upon 
a long line of work (listed in their paper) and proved that unsorted ordering for multidimensional keys can 
yield optimal search time. 

• We are able to extend our framework such that our algorithms are cache-oblivious I/O efficient. This 
is discussed in Section [3] 

• We allow updates: for e > 0, each update operation costs <3((l/e)n e ) time, so that we can allow about 
n 1_€ online insertion and/or deletion operations without significantly affecting our overall running 
time. This is described in Section 0J 

• We are also able to compress our data structures, again in an online fashion, such that as queries are 
processed, the space used by our data structures shrink. This is described in Section [5l 

1 In this paper, we use the notation log^ a = (log 6 a) c = (log a/ log b) c to denote the cth power of the base-6 logarithm of a. 
If no base is specified, the implied base is 2. 
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2 The Framework 



In this section, we describe the algorithms in our framework. The input is an unsorted list or array A of 
linearly ordered items. We assume A contains no duplicates. As successive queries are answered, the items 
in A are rearranged such that, over time, A becomes sorted. Let n denote the length of A, with the ith item 
at k[i — 1]. 

2.1 Pivot Function. 

We assume a pivot selection function, pivotQ, for use with quickselect, has been provided. pivotQ takes 
as inputs A and integers x,y with < x < y < n, and returns with the pivot, chosen from k[x], . . . , k[y], 
swapped into k[y]. This function may rearrange the data in this portion of A, but no other changes are made. 
The empty function is a valid pivot function. 

2.2 Preprocessing Phase. 

Create a bit vector B also of length n, initialized to zeroes. Then find the minimum and maximum items in 
A and swap them to positions A[0] and k[n — 1] respectively. Then set B[0] = B[n — 1] = 1. This takes linear 
time. 

2.3 Bit Vector Invariant. 

If B[i] = 1, then for j < i, we have k[j] < k[i], and for j > i, we have A[j] > k[i). Obviously, this invariant 
holds after our preprocessing phase. 

2.4 Selection Query Algorithm (qselect) 

Our input is an integer k, with < k < n, and the goal is to find an item a from A such that if A were 
sorted in non-decreasing order, a = k[k]. We also rearrange the items in A so that, in fact, a = k[k], and also 
B[fc] = 1 while maintaining the invariant. 

1. If B[fc] = 1, return a = k[k] and terminate. 

2. Scan B to the left and right of position k to find bit positions I and r such that £ < k < r with 
B[£] — B[r] — 1 and, for every i, I < i < r implies B[i] = 0. The preprocessing phase guarantees £ and 
r exist. 

3. Perform Hoare's recursive quickselect algorithm on the list k[£ + 1], . . . , A[r — 1]. (So pivot() is called 
using x = I + 1 and y = r — 1.) Whenever a pivot element is placed in its correct position in A by the 
partition function, set the pivot's corresponding bit to 1 in B. This preserves the invariant. 

4. Hoare's algorithm will find and return a as described above. 

Note that queries with a value of k that is repeated or, by happenstance, is a position of a previously chosen 
pivot, are processed in constant time with no data comparisons. 

2.5 Average-Case Analysis 

Here we apply our framework with a pivot selection function that merely returns k[y] as the pivot, and 
determine the running time, averaged over permutations of the input, of processing q distinct selection 
queries. 

The key to our analysis is to observe that, as queries are answered, the algorithm is doing exactly the 
same work, in the sense that it chooses the same pivots from subintervals of the same position and size, as 
the quicksort algorithm [16] , although not necessarily in the same order. 

Let Hk denote the A;th harmonic number, and recall that Hk — O(logfc). Let T(n,q) denote the average 
number of item comparisons performed to answer q distinct selection queries on a list A of length n, after 
preprocessing, using qselect with the trivial pivot function. 
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Theorem 1 T(n,q) < 2nH q = 0(n log q). 

Proof: On a random input permutation of n items, the q queries can be split over a pivot in q + 1 ways (k 
to the left, q — k to the right, < k < q). 

We ignore the case when one of the q queries matches a pivot. Such a query gets processed in constant 
time, and as q < n, this is at most a linear cost total, using no item comparisons, so we can simply rule 
them out and use a smaller value for q that only counts queries that are not previous pivots. Using this 
simplification, we get the recurrence 

j n-1 q 

T(n ' l) = n + ZJTTu E E^ k ) +T(n-j-l,q- k)). 

+ j=0 k=0 

A simple change of variables give us 

2 ™ _1 q 

vy ; j=0 fe=0 

Observe that if we set q = 1 and recall that T(n, 0) = 0, we obtain the recurrence for Hoare's quickselect 
algorithm. 

Dropping the T(j, 0) terms, which are zero, this leaves us with 

2 ™ _1 q 

We now show, using induction on n, that T(n, q) < 2nH q . 

2 1 n—l 2 q 11-1 

vy ; fe=l j=0 vy ' fe=l j"=0 

2n 9 2n 

< n +(^Y)E^ = »+^iy((9 + W-«) ^ 2n ^- 

This follows, since ^=0 H i = ( k + l ) H k - k and 1/2 < + 1) < 1. □ 
2.6 Search Query Algorithm (qsearch) 

Our input is a value a, and the goal is to compute an integer k, if one exists, such that a = k[k], with A 
rearranged so that a is in its correct position if A were sorted. (In other words, we will have B[fc] = 1 with 
the invariant preserved.) 

1. Binary Search Step: Starting with A[0] and A[n — 1] as the left and right endpoints (low = 0,high = 
n—l), perform the steps of binary search in A for a, ignoring the fact that A is not sorted. Continue 
until binary search terminates normally, either by finding a at position i or finding a position i such 
that k[i] < a < k[i + 1]. By the invariant, we have either found a, or we have located an unsorted 
subinterval containing a if it is present. 

2. qselect Step: Proceed as in the qselect algorithm; scan left and right of i to identify I and r, and 
perform a recursive quickselect using our chosen pivot () function. We choose which side of the pivot 
to recurse on based on the value of a relative to the pivot. 

We consider a repeated search query one that takes the binary search step to the same position in the 
array; this can happen even if the query is not for a repeated value. For repeated searches, the total time is 
O(logn) for the binary search, and we never perform the qselect step. 

The decision not to use the value of a to partition is deliberate and necessary for search queries to be 
treated, from an algorithm analysis perspective, as selection queries with O(logn) additional work. For q 
search queries, that additional work is O(glogn) for the binary search steps, which is dominated by the 
O(nlogg) time done by qselect (either in the second step or as a selection query). 
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3 Online Cache Oblivious Search and Selection 



In this section, we describe an extension to the algorithms for select and search described in Sections 1 2 . 41 — [2T6l 
for the cache-oblivious external memory model |29j . To achieve this, we introduce a simple data structure 
called a pivot tree, which stores an up-to-date list of pivots that have been chosen so far. This tree is used 
to maintain the search query in reasonable time — select does not require the pivot tree at all. The overall 
algorithm remains relatively simple and is similar to the one described for the internal memory model earlier. 
We begin with some brief notation for the external memory model. 

External Memory Model. In the external memory model, the computer is abstracted to consist of two 
memory levels: the internal memory of size M, and the (unbounded) disk memory, which operates by reading 
and writing data in blocks of size B. We refer to the number of items of the input by N . For convenience, 
we define n — N/B and m = M/B as the number of blocks of input and memory, respectively. We make the 
reasonable assumption that 1 < B < M/2. In this model, we assume that each I/O read or write is charged 
one unit of time, and that an internal memory operation is charged no units of time. The cache- oblivious 
model measures performance the same way, but the algorithm has no knowledge of M or B. To achieve the 
optimal sorting bound of SortlO(N) = 0(nlog m n) in this setting, it is necessary to make the tall cache 
assumption [B]: M = il(B 1+e ), for some constant e > 0. In practice, M — fl(B 2 ), and we will make this 
assumption for the remainder of the paper. 

3.1 Pivot Tree 

Let the pivot tree P be a dynamic cache-oblivious -B-tree [3] HI E] that maintains one entry for each element 
of A that has been chosen as a pivot. This is essentially achieved by maintaining a simple binary search tree 
laid out in van Emde Boas [28j format, augmented with ordered-file maintenance on its leaves. This tree 
supports insertions and deletions in 0((logN)/B + log B N) I/Os and searches in 0(log B N) I/Os. Each 
pivot selected by our algorithm is inserted into P; we will show that for every t(n) processing time, we 
insert p — 0(y/t(n)) pivots, with an overall cost of o(t(n)) additional time. We will use this tree primarily 
to support the search query. Our algorithms are similar to the ones we developed for the internal memory 
case. We describe differences below: 

• During the preprocessing phase, insert A[0] and A[n — 1] into P. 

• Add the following condition to the invariant from Section [2~3l For each array position i, if B[i] = 1, P 
contains the value k[i]. 

The select query is nearly identical, except that all pivots discovered in a query are also inserted into P. 
This maintains the invariant for select. The search query performs its "binary search" phase on P instead 
of directly on the array A. Once the unsorted subintcrval is found that contains the query item a, we resort 
to the select algorithm on that subinterval as before, again inserting pivots into P as they are found. 

3.2 Pivot Selection 

We partition the set of items into disjoint subsets such that the invariants are maintained for A and B. In 
internal memory, we choose one new pivot at each recursive step, and take 0(logN) total recursions, each 
of which takes linear time. We achieve good bounds, since our internal memory algorithm chooses the same 
pivots as the ones required for quicksort, although not in the same order. For the I/O case, we need to limit 
the number of recursive steps our algorithms take to achieve SortlO(N) = 0(nlog m n) I/Os in total (after 
q = ii queries), which is the sorting lower bound in external memory. To this end, we will choose p — 1 
partitioning elements so that the partitions are of roughly equal size. When that is the case, the bucket size 
decreases from one level of recursion to the next by a relative factor of 0(p), and thus there are 0(log p n) 
levels of recursion. 

Now we reason about the size of p. As the items stream through internal memory, they are partitioned 
into p buckets in an online manner. Specifically, when a buffer of size B fills for one of the buckets, its 
block can be output to disk, and another buffer (block) takes its place. Hence, the maximum number p 
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of partitioning elements is 0(m) (i.e., the maximum number of buffers that will fit in memory). Here, the 
number of levels of recursion is 0(log m n). In the final level of recursion, there is no point in having partitions 
with fewer than ©(to) blocks (since we can sort them for free), so we can limit p to be 0{n/m). These two 
constraints suggest that the desired number p of partitioning elements is 6(min{m, n/m}) = <d(y/n). 

Probabilistic methods for choosing partition elements based on random sampling [9j are simple and allow 
us to choose p = 0(min{m, n/m}) — 0(y / n) partitioning elements cache-obliviously in o{n) I/Os. We briefly 
sketch the idea: Let d = 0(\ogp). We take a random sample of dp items, sort the sampled items, and then 
choose every dth item as a partitioning element. Each resulting partition has the desired size of 0(N/p) 
items. The number of I/Os needed to choose these partitioning elements is 0(dp + SortlO(dp)). Since 
p = 0{yfn) does not depend on M or B, the I/O bound is 0(\/n\og 2 n) — o(n) for choosing pivots in this 
way, and is therefore negligible. 

Theorem 2 Given an array A of N unsorted items, there exists an online cache- oblivious data structure 
that can support q select and search queries in an online fashion, such that select takes 0(q + nlog m q) time 
over q unique queries, and search takes 0(nlog m q + qlog B N) time over q queries, when q < N. 

Proof: To bound select correctly, we focus on a single step of the distribution sort, where p — 0(y/n) 
partitioning elements are chosen using 0(y/n\og 2 n) I/Os. In addition, we must insert each of these pivots 
into the pivot tree, requiring 0(^/n\og B N) I/Os. Placing unsorted blocks in the correct partitions requires 
0(n) I/Os. Overall, we spend 0(n + y / n(log 2 n + log B N)) = 0(n) I/Os. Since this 0(n) cost is part of the 
nlog TO <7 term from distribution sort, the remaining costs are all lower order terms. Finally, we must spend 
at least one I/O per select, contributing an additive 0(q) I/Os. The search bound includes an additional 
0(log B N) I/Os for traversing the pivot tree for each of the q queries. □ 

4 Allowing Updates 

In this section, we extend our results for both internal memory and cache-oblivious external memory to the 
dynamic case. Recall that we are originally given the unsorted list A. Let A' denote the current list. We want 
to support the following two additional operations: 

• insert (a) inserts a into A'; 

• delete(\) deletes the ith (sorted) entry from A'. 

Our solution uses the dynamic bit dictionary Bitlndcl data structure from [14j . We define the following 
set of supported operations on a dynamic bitvector B' of current length n' and original length n. The 
bitvector B' is stored implicitly, and is primarily used for the sake of discussion. In the rest of this section, 
we refer to b as either or 1, depending on context. 

• rankb(i) tells the number of b bits up to the ith position in B'; 

• selectb(i) gives the position in B' of the ith b bit; 

• insertb(i) inserts the bit b in the ith position; 

• delete(i) deletes the bit located in the zth position; 

• flip{i) flips the bit in the ith position. 

Note that one can determine the ith bit of B' by computing rank%(i) — rank\(i — 1). (For convenience, we 
assume that rankb{— 1) = 0.) 

Lemma 3 ([14]) Given a bitvector B' with length n' and original length n, Bitlndel is a cache- oblivious 
data structure that takes 0(n') bits and supports rank and select in 0(log„ n') time, and insert and delete 
in 0(n e log„n') amortized time. When n' = o(n), the bounds become 0(1) and 0(n e ) respectively, for any 
constant < e < 1. 
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We will use a similar technique to dynamize both internal and external memory scenarios. We will 
maintain the list A (although we will swap values as usual) and retain the bitvector B that marks which 
entries of A are pivots. We define an insert bitvector I such that = 1 if and only if A'[z] is newly inserted. 
Similarly, we define a delete bitvector D such that if D[i] = 1, the zth item in A has been deleted. If a newly 
inserted item is deleted, it is removed from / directly. Both / and D are implemented as instances of the 
Bitlndel data structure. 

We will also use a data structure T that maintains the value and position in A' of all newly-inserted 
items. We manage T similarly to what was done in p3] for the Bitlndel data structure. We define T to be 
a Weight-Balanced B-tree (WBB) with fanout between [ro e ,2n e ] for a fixed e > 0. The leaves of this tree 
maintain contiguous chunks of an array of all newly-inserted items, in value (and thus position) order. Each 
entry in the array maintains both its value and its correct position in the array A' . The tree T is balanced 
by array index among newly-inserted items, or in other words, the rank of each item in the array. 

Each internal node of the tree maintains a prefix-sum data structure that can quickly direct a query to 
the appropriate branch. This can be done in 0(1) time per node according to [Tf] , At the bottom level, 
each "leaf" node is an array of contiguous positions; we access them directly in 0(1) time as usual. When 
we update T, it takes at most 0((l/e)n e ) time, as per the discussion in [T¥j . 

Our preprocessing steps are the same as the static cases. The bitvectors I and D are storing the implicit 
bitvector of n Os. The WBB tree T is initially empty. Next, we describe how to support A' .insert (a), 
k'.delete(i), A' .select(i), and A' .search(a) for the internal memory case. 

4.1 Internal Memory Dynamic Online Selection and Search 

A' .insert(a). First, we search for the appropriate unsorted interval [£, r] containing a using a binary search 
on the original (unsorted) array A. Now perform qsearch(&) on interval [£,r] (choosing which subinterval to 
expand based on the insertion key a) until a's exact position t in A is determined. The original array A must 
have chosen as pivots the elements immediately to its left and right (positions t—1 and t in array A); hence, 
one never needs to consider newly-inserted pivots when choosing subintervals. Insert a in sorted order in T 
among all other newly-inserted items. Calculate t' = I.selecto(t), and set a's position to t" = t' — D.ranki(t'). 
Finally, we update our bitvectors by performing I .insert±(t") and D.inserto(t"). 

A' .delete(i). First, we compute i' = D.selecto(i). If i' is newly-inserted (i.e., I[i'\ — 1), then we scan 
through T and remove the item with position %' — D.ranki(i'), and reduce the position of all newly-inserted 
items after that position by 1. Then wc perform I .deleteii!) and D.delete(i'). If instead i' is an older entry, 
we update T to reduce the position of all newly-inserted items after position i' — D.rank\(i') and perform 
D.flipii 1 )® 

A 1 .select (i). If I[i] = 1, go to the t — I.rank±(i) entry of T and report the value there. Otherwise, compute 
t = I.ranko(i) — D.rank\(i). Now, perform the qselect(t). 

A' .search(a). First, we search for the appropriate unsorted interval [£, r] containing a using a binary search 
on the original (unsorted) array A. Then, perform qsearch(a) on interval [£, r\ until a's exact position t is 
found. If a appears in array A (which we discover through qsearch), we need to now check whether it has 
been deleted. We compute t' = I.selecto(t) and t" = t' — D.ranki(t r ). If D[t'] — 0, return t" . Otherwise, 
it is possible that the item has been newly-inserted. Compute p = I.ranki(t'), which is the number of 
newly-inserted items. If T[p — 1] = a or T[p] — a, return its position (which is stored in T). Otherwise, 
return failure. 

Time Complexity. Since every operation except delete generates pivots by calling qselect or qsearch, they 
contribute to the O(nlogg) overall time required over q queries. Each access to T is done with an explicit 
rank, and so these queries are finished in 0(1/ e) time each. The functions delete and insert contribute an 

2 If a user wants to delete an item with value a, one could simply search for it first to discover its rank, and then delete it 
using this function. Curiously enough, this means we can delete items that haven't yet been placed correctly! 
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additive 0((l/e)n e ) time per operation to perform Bitlndcl insertions and deletions, as well as manipulation 
(and possible rewriting) of T. Thus, we arrive at the following theorem. 

Theorem 4 Given a dynamic array A' of n original items and n' = o(n) current items, there exists a 
dynamic online internal memory data structure that can support q < n select and search queries in an online 
fashion, such that select takes 0{n\ogq) time over q unique queries and search takes 0(n\ogq + qlogn) 
time over q queries. Insertions and deletions to A take 0((l/e)n e ) time each. 

4.2 Cache-Oblivious Dynamic Online Selection and Search 

For this, we augment our data structure in Section [3] with the WBB tree T on newly inserted items. We can 
implement T in a cache-oblivious way using a van Emde Boas layout, together with ordered-file maintenance 
on leaves. The prefix-sum data structure in each node of T is cache-oblivious [B ■ Since all of the data 
structures above are already cache-oblivious, it is sufficient to show that we do not incur any unnecessary 
I/Os during our search and select operations. Moreover, when we rewrite entire portions of T, these portions 
are contiguous blocks of memory, and require no more than 0((1/ Be)n £ ) I/Os. 

Theorem 5 Given a dynamic array A' of N original items and N' = o(N) current items, there exists a 
dynamic online cache- oblivious data structure that can support q < N select and search queries in an online 
fashion, such that select takes 0{q + nlog m q) I/Os and search takes 0(n log m q + q log B N) I/Os over q 
unique queries. Insertions and deletions to A take 0({1/Be)n e ) I/Os. 

Proof: Since we are choosing the same pivots as the external distribution sort from Section [31 we are doing 
our initial search on the pivot tree P, and all data structures and their access is contiguous, the theorem is 
proved. □ 



5 Online Data Compression 

In this section, we briefly outline results to expand the static online selection problem to compress and sort 
simultaneously. To do this, we group chunks of log 2 n items together into a BSGAP [T3J data structure. In 
this structure, one can encode the underlying data in terms of the "gaps" between contiguous items, and 
still binary search the items contained therein. 

We describe our data structure in a top-down way. At the top level, we will build a pivot tree, implemented 
as an augmented Andersson-Thorup predecessor data structure [2] containing no more than 0(n/ log 2 n) 
pivots. Each pivot will maintain its rank among other items, in addition to its value. (This rank information 
can be maintained within the exponential tree.) Overall, the pivot tree will require 0(n/ log n) = o(n) bits 
in total. This pivot tree is initially empty. 

As the pivot () function is called and we promote items to pivots, we build BSGAP data structures if 
we encounter an unsorted interval of size 0(log 2 n). The root of this BSGAP structure is "gap encoded" 
according to its left or right endpoint of the interval, as appropriate. Furthermore, we maintain a pointer 
from the pivot to this BSGAP structure using O(logn) bits of space. 

If the interval is larger, we must partition the elements according to the pivot element and recurse. During 
this process, we encode each item according to its best ancestor. This best ancestor must either be the left 
or right endpoint (pivot) of its unsorted interval; this information can be stored with a single bit. 

We defer the details until the full version of the paper; these results also have a dynamic analogue, 
although not as compressed. We present some results for the average case analysis in the static case. 

Theorem 6 Given an array A of n unsorted items drawn from a universe of size u, there exists an online 
data structure that can support q select and search queries in an online fashion, such that search and select 
require 0(AT(u,n) + nlogq) time over q queries, when q < n. In addition, the space of the data structure 
eventually diminishes to gap + 0(n log log(u/n)) bits as a natural consequence of answering queries. 
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6 Implementation Results 



In this section we present some preliminary experiments. Our goal is to show that the qselect and qsearch 
algorithms are practical and reasonably efficient. 



qselect. In the table below, the first row gives the times for sorting using qselect by using the ranks 
k = 1,3,5, . . .. This should be competitive, but slower than, quicksort. The second row shows results for 
y/n randomly chosen selections using qselect; this should take less time than sorting. (0(n log q) time with 
q = \pa versus 0(n\ogn) time). Finally, we timed Quicksort. Our results are as expected. 



Method 



10 5 



10 b 10 7 



k = 1, 3, 5, . . . to sort 
^fn selections 
Quicksort 



0.021 0.240 2.78 
0.010 0.120 1.32 
0.016 0.188 2.16 



For all methods, we used lists of n randomly chosen integers from the range 0, . . . , n — 1. Times are reported 
in average number of CPU seconds. We performed the experiments on an Intel Pentium D, 3.0GHz with 
1MB of cache, running Linux and using the GNU g++ compiler with level 2 optimization. We did not 
bother to pack our bit vector. 



qsearch. In the table below, we compare the times for two algorithms for online searching: 

• Old Method: Perform a quicksort, then use interpolation search to find items. 

• New Method: Use qsearch, with interpolation search in place of binary search. 



Search Count Method 

q = Old 
New 

q = n/10 Old 
New 



10 5 10 6 10 7 

0.0164 0.198 2.22 

0.0100 0.114 1.27 

0.0196 0.240 2.87 

0.0192 0.263 3.12 



Search keys were chosen uniformly at random. Using interpolation search in place of binary search should 
favor the Old Method, as the data are uniformly distributed. The results we obtained using binary search 
were similar to what we report here. We obtained times for q = ^fn searches, which should favor our new 
algorithm (like above), and for q = n/10 searches, which should favor the old algorithm, since we are almost 
completely sorting the list. 
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